SEQUENCE LISTING 



GENERAL INFOiyVIATION : 



(i) APPLICANT: ANDERSON, Darrell R. 

HANNA, Nabil 
LEONARD, John E. 
NEWMAN, Roland A. 
REFF, Mitchell E. 
RASTETTER, William H. 

A«TI3K» FOR THE XRE^T^T OF = CEI.I. 

LWPHOMA 



(iii) NUMBErNdF SEQUENCES: 11 



iA^r\ r-ORPPSPON^ENCE ADDRESS: ^ t 

(,v) 3^s, DOANE, SWECKER . MATHIS, L . I. - P - 

(B) STREEt\p.O. box 14 04 

(C) CITY: Alexandria 

(D) STATE: Vic^inia 

(E) COUNTRY: Uikted States 

(F) ZIP: 22313-1^4 

(v) COMPUTER READABLE 

(A) MEDIUM TYPE: Flotoy disk 

(B) COMPUTER: IBM PC dompatible 

(C) OPERATING SYSTEM: pV DOS /MS -DOS 

(D) SOFTWARE: Patentin R^ease #1-0, Versxon #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 0^/921,060 

(B) FILING DATE: 29-AUG-1997 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: Xoo 
CA) APPLICATION NUMBER: US 08/149,0,99 
(B) FILING DATE: 03-NOV-1993 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/978,891 

(B) FILING DATE: 13 -NOV- 1992 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Teskin, Robin L. 

(B) REGISTRATION NUMBER: 35,030 

(C) REFERENCE/DOCKET NUMBER: 012712-432 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (703) 836-6620 

(B) TELEFAX: (703) 836-2021 
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(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
GGGAGCTTGG ATCGATCCTC TATGGTT 
(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS:^ 

(A) LENGTH: 8540 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



27 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
O„O0 CCTCC^^ ..CCTCC.C. =T.™ »™3 

™o.c =™cc xc.™ t.™t ™cc. ™30c 
„3.o c^c^o =3».™.o 33ca.....e .C~3 ™.CT 

™CTO .CT»T^<= T™TC T.CCTOCX3. ^O.OCCXa^ 

„C.C .CC™ T..CX»XT. .C~C TTT.C.T.CX TCXOCCXOCT 

^.oc™ ooo^crrrcc .c.cccx»c ™- tc™t x^ttccccx 

.3X™ .OX^^TCW X.C=3a=TC. ™<=TTC.» OC^T 

™»= ™™ -cccoccT oocxo^ccoc cc»co.c<:c ™™ 

™t3T TCC™ ™- 03.™ ~C» 
„.CT «CX3CCC.C X™.C .XC^^TOT. TC.™ 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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AGTACGCCCC CTATTGACGT CAATGACGGT AAATGGCCCG CCTGGCATTA TGCCCAGTAC 66 0 

ATGACCTTAT GGGACTTTCC TACTTGGCAG TACATCTACG TATTAGTCAT CGCTATTACC 72 0 

ATGGTGATGC GGTTTTGGCA GTACATCAAT GGGCGTGGAT AGCGGTTTGA CTCACGGGGA 78 0 

TTTCCAAGTC TCCACCCCAT TGACGTCAAT GGGAGTTTGT TTTGGCACCA AAATCAACGG 84 0 

GACTTTCCAA AATGTCGTAA CAACTCCGCC CCATTGACGC AAATGGGCGG TAGGCGTGTA 900 

CGGTGGGAGG TCTATATAAG CAGAGCTGGG TACGTGAACC GTCAGATCGC CTGGAGACGC 96 0 

CATCACAGAT CTCTCACCAT GAGGGTCCCC GCTCAGCTCC TGGGGCTCCT GCTGCTCTGG 102 0 

CTCCCAGGTG CACGATGTGA TGGTACCAAG GTGGAAATCA AACGTACGGT GGCTGCACCA 1080 

TCTGTCTTCA TCTTCCCGCC ATCTGATGAG CAGTTGAAAT CTGGAACTGC CTCTGTTGTG 1140 

TGCCTGCTGA ATAACTTCTA TCCCAGAGAG GCCAAAGTAC AGTGGAAGGT GGATAACGCC 1200 

CTCCAATCGG GTAACTCCCA GGAGAGTGTC ACAGAGCAGG ACAGCAAGGA CAGCACCTAC 126 0 

AGCCTCAGCA GCACCCTGAC GCTGAGCAAA GCAGACTACG AGAAACACAA AGTCTACGCC 132 0 

TGCGAAGTCA CCCATCAGGG CCTGAGCTCG CCCGTCACAA AGAGCTTCAA CAGGGGAGAG 13 80 

TGTTGAATTC AGATCCGTTA ACGGTTACCA ACTACCTAGA CTGGATTCGT GACAACATGC 144 0 

GGCCGTGATA TCTACGTATG ATCAGCCTCG ACTGTGCCTT CTAGTTGCCA GCCATCTGTT 1500 

GTTTGCCCCT CCCCCGTGCC TTCCTTGACC CTGGAAGGTG CCACTCCCAC TGTCCTTTCC 1560 

TAATAAAATG AGGAAATTGC ATCGCATTGT CTGAGTAGGT GTCATTCTAT TCTGGGGGGT 1620 

GGGGTGGGGC AGGACAGCAA GGGGGAGGAT TGGGAAGACA ATAGCAGGCA TGCTGGGGAT 1680 

GCGGTGGGCT CTATGGAACC AGCTGGGGCT CGACAGCTAT GCCAAGTACG CCCCCTATTG 174 0 

ACGTCAATGA CGGTAAATGG CCCGCCTGGC ATTATGCCCA GTACATGACC TTATGGGACT 1800 

TTCCTACTTG GCAGTACATC TACGTATTAG TCATCGCTAT TACCATGGTG ATGCGGTTTT 18 60 

GGCAGTACAT CAATGGGCGT GGATAGCGGT TTGACTCACG GGGATTTCCA AGTCTCCACC 1920 

CCATTGACGT CAATGGGAGT TTGTTTTGGC ACCAAAATCA ACGGGACTTT CCAAAATGTC 1980 

GTAACAACTC CGCCCCATTG ACGCAAATGG GCGGTAGGCG TGTACGGTGG GAGGTCTATA 204 0 

TAAGCAGAGC TGGGTACGTC CTCACATTCA GTGATCAGCA CTGAACACAG ACCCGTCGAC 2100 

ATGGGTTGGA GCCTCATCTT GCTCTTCCTT GTCGCTGTTG CTACGCGTGT CGCTAGCACC 2160 

AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 2220 

GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 22 80 
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^COCCCT» CCA=C=OCOT OC.C.CCTTC CC^CTOTCC T.C.OTCCTC A.O.CTCT.C 
.CCCT«<=C. aC3T<3.T<3.C COT.CCCTCC .OC.OCrr<== OC»CCCA..C CT.C.TCT.3C 
;^C3T3;^TC .C«.=CCC.O C«C.CC»0 aTOa.C«0» ^OC.O.OCC C^TCTTGT 
O^C^CTC .CCOTCCCC. <=C.CCT3«C TCCT0<=003« .CCOTC.CTC 

rrCCTCTTCC CCCCA^^CC C^^.C.CC CCTO^TC CCCG..CCCC T.A,K.TC.C. 

™a.CO...G C«Ca«=- .OTTC^^ACTO OT.CGTOa.C 

03C0T^.33 TOCT^TOC C»O.C^ CCaC«=0.03 .OC^T.C» COCCO^ 
COT^T^TC COTCCTOCC C^^CT^ TO«T.=C« 33.CT.C«. 

T3C«>33TCT CC»C»^3C CCTCCC.3CC CCC.TC3.3. ^^CCATCTC C;^3CC«* 
.03C»0CCCC 0.3^CC.C. 33T3T»C.CC C^CCCCC.T CCC33=.T3. 3C.3.CC.33 
„CC.33TC. 3CCT3.CCT3 CC^TC^^ CC.3C3.C.T C3CC0T03.3 

^33.3.3C» .T033C.3CC 33.3«C^C ..C^O.CC. C3CCTCCC3T 3CTa3.CTCC 
3.C33CTCCT TCTTCCTCT. C.3C»0CTC .CC3X33AC;> A3;.0C.33T3 3C^3C.33=3 
„C3TCTTCT C.T3CTCC3T 3;.T3«T3A3 3CTCT3C.C. .CC.CT.C^C 3CA3;«3.3C 
CTCTCCCT3T 3TCC333T» AT3A33.TCC 3™C30TT .CC^^CTACC T.G»CT33AT 
.T3C33CC3. 3.T.TCT.C3 T.^C CXCa.CT3X3 CCT^ 
3CC.3CC.TC T3TT3TTT3C CCCTCCCCC3 ^CCTTCCTT 3.CCCT33» 33T0CC.CTC 
CCCT3TCCT rrCCT»T» «.T3;.33«* ri.C.TC3C. TT3TCT3.3T .33T3TCATT 
C™TCT3a3 333T3333T3 3333.33.3^ 33^33333. 33.TT333;. 3.3»T.3C» 
333.T33T03 33.T3C33T3 333^3^33 »3C.3C.33 333T33»C.O C33T33.T3. 
33C3.T3C33 ;.33rrr3CTT 3T3«TTT3T T.TTT3C.T. .T3.3«^ — 
«™C. 3C«^C.3T .3™.TT3. 33^T3C3T T33C^3 3.T33.™3 
.0.3.3..TT 3.3T33.3.3 .T^33.C« .3.™TT3. 3.333.3T.C C3.3.33^ 
3.3TC3T«3 33.3T3.3T3 3C»C^3ATT CT.333^» .T.T33TT3T C.T3A333*. 
333X3.^3 3T.3.33C.3 .CCTTOGT^ 33333»T3T 33T3A3A3;.3 3.T.3.3.33 
33.33.3C3. 3333.3.33. T.T»33T3. 33T.33..C 3.T33TC3T3 .C.™ 
3_3T T3T3TT333. 33TT33.T.0 3TT33.3.33 .3.3333T3C 3.^3 
3«.3TT3.C 33C«T33T. 33G.«»333 T33T.33.T. T..T333C3C T3C3.^T3 



2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 



GTTCGACCAT TGAACTGCAT CGTCGCCGTG TCCCAAAATA TGGGGATTGG CAAGAACGGA 402 0 

GACCTACCCT GGCCTCCGCT CAGGAACGAG TTCAAGTACT TCCAAAGAAT GACCACAACC 4080 

TCTTCAGTGG AAGGTAAACA GAATCTGGTG ATTATGGGTA GGAAAACCTG GTTCTCCATT 414 0 

CCTGAGAACA ATCGACCTTT AAAGGACAGA ATTAATATAG TTCTCAGTAG AGAACTCAAA 42 00 

GAACCACCAC GAGGAGCTCA TTTTCTTGCC AAAAGTTTGG ATGATGCCTT AAGACTTATT 4260 

GAACAACCGG AATTGGCAAG TAAAGTAGAC ATGGTTTGGA TAGTCGGAGG CAGTTCTGTT 4320 

TACCAGGAAG CCATGAATCA ACCAGGCCAC CTTAGACTCT TTGTGACAAG GATCATGCAG 43 8 0 

GAATTTGAAA GTGACACGTT TTTCCCAGAA ATTGATTTGG GGAAATATAA ACTTCTCCCA 444 0 

GAATACCCAG GCGTCCTCTC TGAGGTCCAG GAGGAAAAAG GCATCAAGTA TAAGTTTGAA 4 500 

GTCTACGAGA AGAAAGACTA ACAGGAAGAT GCTTTCAAGT TCTCTGCTCC CCTCCTAAAG 456 0 

TCATGCATTT TTATAAGACC ATGGGACTTT TGCTGGCTTT AGATCAGCCT CGACTGTGCC 4620 

TTCTAGTTGC CAGCCATCTG TTGTTTGCCC CTCCCCCGTG CCTTCCTTGA CCCTGGAAGG 468 0 

TGCCACTCCC ACTGTCCTTT CCTAATAAAA TGAGGAAATT GCATCGCATT GTCTGAGTAG 4 740 

GTGTCATTCT ATTCTGGGGG GTGGGGTGGG GCAGGACAGC AAGGGGGAGG ATTGGGAAGA 48 00 

CAATAGCAGG CATGCTGGGG ATGCGGTGGG CTCTATGGAA CCAGCTGGGG CTCGAGCTAC 4 86 0 

TAGCTTTGCT TCTCAATTTC TTATTTGCAT AATGAGAAAA AAAGGAAAAT TAATTTTAAC 4 92 0 

ACCAATTCAG TAGTTGATTG AGCAAATGCG TTGCCAAAAA GGATGCTTTA GAGACAGTGT 4980 

TCTCTGCACA GATAAGGACA AACATTATTC AGAGGGAGTA CCCAGAGCTG AGACTCCTAA 5040 

GCCAGTGAGT GGCACAGCAT TCTAGGGAGA AATATGCTTG TCATCACCGA AGCCTGATTC 5100 

CGTAGAGCCA CACCTTGGTA AGGGCCAATC TGCTCACACA GGATAGAGAG GGCAGGAGCC 516 0 

AGGGCAGAGC ATATAAGGTG AGGTAGGATC AGTTGCTCCT CACATTTGCT TCTGACATAG 522 0 

TTGTGTTGGG AGCTTGGATC GATCCTCTAT GGTTGAACAA GATGGATTGC ACGCAGGTTC 5280 

TCCGGCCGCT TGGGTGGAGA GGCTATTCGG CTATGACTGG GCACAACAGA CAATCGGCTG 534 0 

CTCTGATGCC GCCGTGTTCC GGCTGTCAGC GCAGGGGCGC CCGGTTCTTT TTGTCAAGAC 5400 

CGACCTGTCC GGTGCCCTGA ATGAACTGCA GGACGAGGCA GCGCGGCTAT CGTGGCTGGC 546 0 

CACGACGGGC GTTCCTTGCG CAGCTGTGCT CGACGTTGTC ACTGAAGCGG GAAGGGACTG 552 0 

GCTGCTATTG GGCGAAGTGC CGGGGCAGGA TCTCCTGTCA TCTCACCTTG CTCCTGCCGA 5580 

GAAAGTATCC ATCATGGCTG ATGCAATGCG GCGGCTGCAT ACGCTTGATC CGGCTACCTG 564 0 
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CCCATTCGAC CACCAAGCGA AACATCGCAT CGAGCGAGCA CGTACTCGGA TGGAAGCCGG 
TCTTGTCGAT CAGGATGATC TGGACGAAGA GCATCAGGGG CTCGCGCCAG CCGAACTGTT 
CGCCAGGCTC AAGGCGCGCA TGCCCGACGG CGAGGATCTC GTCGTGACCC ATGGCGATGC 
CTGCTTGCCG AATATCATGG TGGAAAATGG CCGCTTTTCT GGATTCATCG ACTGTGGCCG 
GCTGGGTGTG GCGGACCGCT ATCAGGACAT AGCGTTGGCT ACCCGTGATA TTGCTGAAGA 
GCTTGGCGGC GAATGGGCTG ACCGCTTCCT CGTGCTTTAC GGTATCGCCG CTTCCCGATT 
CGCAGCGCAT CGCCTTCTAT CGCCTTCTTG ACGAGTTCTT CTGAGCGGGA CTCTGGGGTT 
CGAAATGACC GACCAAGCGA CGCCCAACCT GCCATCACGA GATTTCGATT CCACCGCCGC 
CTTCTATGAA AGGTTGGGCT TCGGAATCGT TTTCCGGGAC GCCGGCTGGA TGATCCTCCA 
GCGCGGGGAT CTCATGCTGG AGTTCTTCGC CCACCCCAAC TTGTTTATTG CAGCTTATAA 
TGGTTACAAA TAAAGCAATA GCATCACAAA TTTCACAAAT AAAGCATTTT TTTCACTGCA 
TTCTAGTTGT GGTTTGTCCA AACTCATCAA TCTATCTTAT CATGTCTGGA TCGCGGCCGC 
GATCCCGTCG AGAGCTTGGC GTAATCATGG TCATAGCTGT TTCCTGTGTG AAATTGTTAT 
CCGCTCACAA TTCCACACAA CATACGAGCC GGAGCATAAA GTGTAAAGCC TGGGGTGCCT 
WGAGTGAG CTAACTCACA TTAATTGCGT TGCGCTCACT GCCCGCTTTC CAGTCGGGAA 
ACCTGTCGTG CCAGCTGCAT TAATGAATCG GCCAACGCGC GGGGAGAGGC GGTTTGCGTA 
TTGGGCGCTC TTCCGCTTCC TCGCTCACTG ACTCGCTGCG CTCGGTCGTT CGGCTGCGGC 
OAGCGGTATC AGCTCACTCA AAGGCGGTAA TACGGTTATC CACAGAATCA GGGGATAACG 
CAGGAAAGAA CATGTGAGCA AAAGGCCAGC AAAAGGCCAG GAACCGTAAA AAGGCCGCGT 
TGCTGGCGTT TTTCCATAGG CTCCGCCCCC CTGACGAGCA TCACAAAAAT CGACGCTCAA 
GTCAGAGGTG GCGAAACCCG ACAGGACTAT AAAGATACCA GGCGTTTCCC CCTGGAAGCT 
CCCTCGTGCG CTCTCCTGTT CCGACCCTGC CGCTTACCGG ATACCTGTCC GCCTTTCTCC 
CTTCGGGAAG CGTGGCGCTT TCTCAATGCT CACGCTGTAG GTATCTCAGT TCGGTGTAGG 
TCGTTCGCTC CAAGCTGGGC TGTGTGCACG AACCCCCCGT TCAGCCCGAC CGCTGCGCCT 
TATCCGGTAA CTATCGTCTT GAGTCCAACC CGGTAAGACA CGACTTATCG CCACTGGCAG 
CAGCCACTGG TAACAGGATT AGCAGAGCGA GGTATGTAGG CGGTGCTACA GAGTTCTTGA 
AGTGGTGGCC TAACTACGGC TACACTAGAA GGACAGTATT TGGTATCTGC GCTCTGCTGA 
AGCCAGTTAC CTTCGGAAAA AGAGTTGGTA GCTCTTGATC CGGCAAACAA ACCACCGCTG 



5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 
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6780 
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7200 
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GTAGCGGTGG TTTTTTTGTT TGCAAGCAGC AGATTACGCG CAGAAAAAAA GGATCTCAAG 73 80 

AAGATCCTTT GATCTTTTCT ACGGGGTCTG ACGCTCAGTG GAACGAAAAC TCACGTTAAG 744 0 

GGATTTTGGT CATGAGATTA TCAAAAAGGA TCTTCACCTA GATCCTTTTA AATTAAAAAT 7500 

GAAGTTTTAA ATCAATCTAA AGTATATATG AGTAAACTTG GTCTGACAGT TACCAATGCT 7 56 0 

TAATCAGTGA GGCACCTATC TCAGCGATCT GTCTATTTCG TTCATCCATA GTTGCCTGAC 762 0 

TCCCCGTCGT GTAGATAACT ACGATACGGG AGGGCTTACC ATCTGGCCCC AGTGCTGCAA 76 80 

TGATACCGCG AGACCCACGC TCACCGGCTC CAGATTTATC AGCAATAAAC CAGCCAGCCG 774 0 

GAAGGGCCGA GCGCAGAAGT GGTCCTGCAA CTTTATCCGC CTCCATCCAG TCTATTAATT 7800 

GTTGCCGGGA AGCTAGAGTA AGTAGTTCGC CAGTTAATAG TTTGCGCAAC GTTGTTGCCA 7 860 

TTGCTACAGG CATCGTGGTG TCACGCTCGT CGTTTGGTAT GGCTTCATTC AGCTCCGGTT 7 92 0 

CCCAACGATC AAGGCGAGTT ACATGATCCC CCATGTTGTG CAAAAAAGCG GTTAGCTCCT 7 980 

TCGGTCCTCC GATCGTTGTC AGAAGTAAGT TGGCCGCAGT GTTATCACTC ATGGTTATGG 8 04 0 

CAGCACTGCA TAATTCTCTT ACTGTCATGC CATCCGTAAG ATGCTTTTCT GTGACTGGTG 8100 

AGTACTCAAC CAAGTCATTC TGAGAATAGT GTATGCGGCG ACCGAGTTGC TCTTGCCCGG 8160 

CGTCAATACG GGATAATACC GCGCCACATA GCAGAACTTT AAAAGTGCTC ATCATTGGAA 822 0 

AACGTTCTTC GGGGCGAAAA CTCTCAAGGA TCTTACCGCT GTTGAGATCC AGTTCGATGT 8280 

AACCCACTCG TGCACCCAAC TGATCTTCAG CATCTTTTAC TTTCACCAGC GTTTCTGGGT 8340 

GAGCAAAAAC AGGAAGGCAA AATGCCGCAA AAAAGGGAAT AAGGGCGACA CGGAAATGTT 84 00 

GAATACTCAT ACTCTTCCTT TTTCAATATT ATTGAAGCAT TTATCAGGGT TATTGTCTCA 84 60 

TGAGCGGATA CATATTTGAA TGTATTTAGA AAAATAAACA AATAGGGGTT CCGCGCACAT 852 0 

TTCCCCGAAA AGTGCCACCT 8540 
(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 92 09 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

GACGTCGCGG CCGCTCTAGG CCTCCAAAAA AGCCTCCTCA CTACTTCTGG AATAGCTCAG 60 

AGGCCGAGGC GGCCTCGGCC TCTGCATAAA TAAAAAAAAT TAGTCAGCCA TGCATGGGGC 120 

GGAGAATGGG CGGAACTGGG CGGAGTTAGG GGCGGGATGG GCGGAGTTAG GGGCGGGACT 18 0 

ATGGTTGCTG ACTAATTGAG ATGCATGCTT TGCATACTTC TGCCTGCTGG GGAGCCTGGG 24 0 

GACTTTCCAC ACCTGGTTGC TGACTAATTG AGATGCATGC TTTGCATACT TCTGCCTGCT 3 00 

GGGGAGCCTG GGGACTTTCC ACACCCTAAC TGACACACAT TCCACAGAAT TAATTCCCCT 360 

AGTTATTAAT AGTAATCAAT TACGGGGTCA TTAGTTCATA GCCCATATAT GGAGTTCCGC 420 

GTTACATAAC TTACGGTAAA TGGCCCGCCT GGCTGACCGC CCAACGACCC CCGCCCATTG 480 

ACGTCAATAA TGACGTATGT TCCCATAGTA ACGCCAATAG GGACTTTCCA TTGACGTCAA 54 0 

TGGGTGGACT ATTTACGGTA AACTGCCCAC TTGGCAGTAC ATCAAGTGTA TCATATGCCA 6 00 

AGTACGCCCC CTATTGACGT CAATGACGGT AAATGGCCCG CCTGGCATTA TGCCCAGTAC 66 0 

ATGACCTTAT GGGACTTTCC TACTTGGCAG TACATCTACG TATTAGTCAT CGCTATTACC 720 

ATGGTGATGC GGTTTTGGCA GTACATCAAT GGGCGTGGAT ACCGGTTTGA CTCACGCGGA 780 

TTTCCAAGTC TCCACCCCAT TGACGTCAAT GGGAGTTTGT TTTGGCACCA AAATCAACGG 84 0 

GACTTTCCAA AATGTCGTAA CAACTCCGCC CCATTGACGC AAATGGGCGG TAGGCGTGTA 900 

CGGTGGGAGG TCTATATAAG CAGAGCTGGG TACGTGAACC GTCAGATCGC CTGGAGACGC 960 

CATCACAGAT CTCTCACTAT GGATTTTCAG GTGCAGATTA TCAGCTTCCT GCTAATCAGT 1020 

GCTTCAGTCA TAATGTCCAG AGGACAAATT GTTCTCTCCC AGTCTCCAGC AATCCTGTCT 1080 

GCATCTCCAG GGGAGAAGGT CACAATGACT TGCAGGGCCA GCTCAAGTGT AAGTTACATC 114 0 

CACTGGTTCC AGCAGAAGCC AGGATCCTCC CCCAAACCCT GGATTTATGC CACATCCAAC 1200 

CTGGCTTCTG GAGTCCCTGT TCGCTTCAGT GGCAGTGGGT CTGGGACTTC TTACTCTCTC 126 0 

ACAATCAGCA GAGTGGAGGC TGAAGATGCT GCCACTTATT ACTGCCAGCA GTGGACTAGT 132 0 

AACCCACCCA CGTTCGGAGG GGGGACCAAG CTGGAAATCA AACGTACGGT GGCTGCACCA 13 80 

TCTGTCTTCA TCTTCCCGCC ATCTGATGAG CAGTTGAAAT CTGGAACTGC CTCTGTTGTG 144 0 

TGCCTGCTGA ATAACTTCTA TCCCAGAGAG GCCAAAGTAC AGTGGAAGGT GGATAACGCC 1500 
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CTCCAATCGG GTAACTCCCA GGAGAGTGTC ACAGAGCAGG ACAGCAAGGA CAGCACCTAC 
AGCCTCAGCA GCACCCTGAC GCTGAGCAAA GCAGACTACG AGAAACACAA AGTCTACGCC 
TGCGAAGTCA CCCATCAGGG CCTGAGCTCG CCCGTCACAA AGAGCTTCAA CAGGGGAGAG 
XGTTGAATTC AGATCCGTTA ACGGTTACCA ACTACCTAGA CTGGATTCGT GACAACATGC 
OGCCGTGATA TCTACGTATG ATCAGCCTCG ACTGTGCCTT CTAGTTGCCA GCCATCTGTT 
GTTTGCCCCT CCCCCGTGCC TTCCTTGACC CTGGAAGGTG CCACTCCCAC TGTCCTTTCC 
TAATAAAATG AGGAAATTGC ATCGCATTGT CTGAGTAGGT GTCATTCTAT TCTGGGGGGT 
GGGGTGGGGC AGGACAGCAA GGGGGAGGAT TGGGAAGACA ATAGCAGGCA TGCTGGGGAT 
OCGGTGGGCT CTATGGAACC AGCTGGGGCT CGACAGCTAT GCCAAGTACG CCCCCTATTG 
ACGTCAATGA CGGTAAATGG CCCGCCTGGC ATTATGCCCA GTACATGACC TTATGGGACT 
TTCCTACTTG GCAGTACATC TACGTATTAG TCATCGCTAT TACCATGGTG ATGCGGTTTT 
GGCAGTACAT CAATGGGCGT GGATAGCGGT TTGACTCACG GGGATTTCCA AGTCTCCACC 
CCATTGACGT CAATGGGAGT TTGTTTTGGC ACCAAAATCA ACGGGACTTT CCAAAATGTC 
GTAACAACTC CGCCCCATTG ACGCAAATGG GCGGTAGGCG TGTACGGTGG GAGGTCTATA 
TAAGCAGAGC TGGGTACGTC CTCACATTCA GTGATCAGCA CTGAACACAG ACCCGTCGAC 
;,TGGGTTGGA GCCTCATCTT GCTCTTCCTT GTCGCTGTTG CTACGCGTGT CCTGTCCCAG 
GTACAACTGC AGCAGCCTGG GGCTGAGCTG GTGAAGCCTG GGGCCTCAGT GAAGATGTCC 
TGCAAGGCTT CTGGCTACAC ATTTACCAGT TACAATATGC ACTGGGTAAA ACAGACACCT 
OGTCGGGGCC TGGAATGGAT TGGAGCTATT TATCCCGGAA ATGGTGATAC TTCCTACAAT 
CAGAAGTTCA AAGGCAAGGC CACATTGACT GCAGACAAAT CCTCCAGCAC AGCCTACATG 
CAGCTCAGCA GCCTGACATC TGAGGACTCT GCGGTCTATT ACTGTGCAAG ATCGACTTAC 
TACGGCGGTG ACTGGTACTT CAATGTCTGG GGCGCAGGGA CCACGGTCAC CGTCTCTGCA 
GCTAGCACCA AGGGCCCATC GGTCTTCCCC CTGGCACCCT CCTCCAAGAG CACCTCTGGG 
GGCACAGCGG CCCTGGGCTG CCTGGTCAAG GACTACTTCC CCGAACCGGT GACGGTGTCG 
TGGAACTCAG GCGCCCTGAC CAGCGGCGTG CACACCTTCC CGGCTGTCCT ACAGTCCTCA 
GGACTCTACT CCCTCAGCAG CGTGGTGACC GTGCCCTCCA GCAGCTTGGG CACCCAGACC 
TACATCTGCA ACGTGAATCA CAAGCCCAGC AACACCAAGG TGGACAAGAA AGCAGAGCCC 
^^TCTTGTG ACAAAACTCA CACATGCCCA CCGTGCCCAG CACCTGAACT CCTGGGGGGA 
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CCGTCAGTCT TCCTCTTCCC CCCAAAACCC AAGGACACCC TCATGATCTC CCGGACCCCT 
GAGGTCACAT GCGTGGTGGT GGACGTGAGC CACGAAGACC CTGAGGTCAA GTTCAACTGG 
TACGTGGACG GCGTGGAGGT GCATAATGCC AAGACAAAGC CGCGGGAGGA GCAGTACAAC 
AGCACGTACC GTGTGGTCAG CGTCCTCACC GTCCTGCACC AGGACTGGCT GAATGGCAAG 
GAGTACAAGT GCAAGGTCTC CAACAAAGCC CTCCCAGCCC CCATCGAGAA AACCATCTCC 
;^GCCAAAG GGCAGCCCCG AGAACCACAG GTGTACACCC TGCCCCCATC CCGGGATGAG 
CTGACCAAGA ACCAGGTCAG CCTGACCTGC CTGGTCAAAG GCTTCTATCC CAGCGACATC 
GCCGTGGAGT GGGAGAGCAA TGGGCAGCCG GAGAACAACT ACAAGACCAC GCCTCCCGTG 
CTGGACTCCG ACGGCTCCTT CTTCCTCTAC AGCAAGCTCA CCGTGGACAA GAGCAGGTGG 
CAGCAGGGGA ACGTCTTCTC ATGCTCCGTG ATGCATGAGG CTCTGCACAA CCACTACACG 
CAGAAGAGCC TCTCCCTGTC TCCGGGTAAA TGAGGATCCG TTAACGGTTA CCAACTACCT 
AGACTGGATT CGTGACAACA TGCGGCCGTG ATATCTACGT ATGATCAGCC TCGACTGTGC 
CTTCTAGTTG CCAGCCATCT GTTGTTTGCC CCTCCCCCGT GCCTTCCTTG ACCCTGGAAG 
GTGCCACTCC CACTGTCCTT TCCTAATAAA ATGAGGAAAT TGCATCGCAT TGTCTGAGTA 
GGTGTCATTC TATTCTGGGG GGTGGGGTGG GGCAGGACAG CAAGGGGGAG GATTGGGAAG 
ACAATAGCAG GCATGCTGGG GATGCGGTGG GCTCTATGGA ACCAGCTGGG GCTCGACAGC 
GCTGGATCTC CCGATCCCCA GCTTTGCTTC TCAATTTCTT ATTTGCATAA TGAGAAAAAA 
AGGAAAATTA ATTTTAACAC CAATTCAGTA GTTGATTGAG CAAATGCGTT GCCAAAAAGG 
ATGCTTTAGA GACAGTGTTC TCTGCACAGA TAAGGACAAA CATTATTCAG AGGGAGTACC 
CAGAGCTGAG ACTCCTAAGC CAGTGAGTGG CACAGCATTC TAGGGAGAAA TATGCTTGTC 
ATCACCGAAG CCTGATTCCG TAGAGCCACA CCTTGGTAAG GGCCAATCTG CTCACACAGG 
ATAGAGAGGG CAGGAGCCAG GGCAGAGCAT ATAAGGTGAG GTAGGATCAG TTGCTCCTCA 
CATTTGCTTC TGACATAGTT GTGTTGGGAG CTTGGATAGC TTGGACAGCT CAGGGCTGCG 
ATTTCGCGCC AAACTTGACG GCAATCCTAG CGTGAAGGCT GGTAGGATTT TATCCCCGCT 
GCCATCATGG TTCGACCATT GAACTGCATC GTCGCCGTGT CCCAAAATAT GGGGATTGGC 
AAGAACGGAG ACCTACCCTG GCCTCCGCTC AGGAACGAGT TCAAGTACTT CCAAAGAATG 
ACCACAACCT CTTCAGTGGA AGGTAAACAG AATCTGGTGA TTATGGGTAG GAAAACCTGG 
TTCTCCATTC CTGAGAAGAA TCGACCTTTA AAGGACAGAA TTAATATAGT TCTCAGTAGA 
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GAACTCAAAG AACCACCACG AGGAGCTCAT TTTCTTGCCA AAAGTTTGGA TGATGCCTTA 4 92 0 

AGACTTATTG AACAACCGGA ATTGGCAAGT AAAGTAGACA TGGTTTGGAT AGTCGGAGGC 4 980 

AGTTCTGTTT ACCAGGAAGC CATGAATCAA CCAGGCCACC TTAGACTCTT TGTGACAAGG 5040 

ATCATGCAGG AATTTGAAAG TGACACGTTT TTCCCAGAAA TTGATTTGGG GAAATATAAA 5100 

CTTCTCCCAG AATACCCAGG CGTCCTCTCT GAGGTCCAGG AGGAAAAAGG CATCAAGTAT 516 0 

AAGTTTGAAG TCTACGAGAA GAAAGACTAA CAGGAAGATG CTTTCAAGTT CTCTGCTCCC 522 0 

CTCCTAAAGC TATGCATTTT TATAAGACCA TGGGACTTTT GCTGGCTTTA GATCAGCCTC 5280 

GACTGTGCCT TCTAGTTGCC AGCCATCTGT TGTTTGCCCC TCCCCCGTGC CTTCCTTGAC 5340 

CCTGGAAGGT GCCACTCCCA CTGTCCTTTC CTAATAAAAT GAGGAAATTG CATCGCATTG 54 00 

TCTGAGTAGG TGTCATTCTA TTCTGGGGGG TGGGGTGGGG CAGGACAGCA AGGGGGAGGA 5460 

TTGGGAAGAC AATAGCAGGC ATGCTGGGGA TGCGGTGGGC TCTATGGAAC CAGCTGGGGC 5520 

TCGAGCTACT AGCTTTGCTT CTCAATTTCT TATTTGCATA ATGAGAAAAA AAGGAAAATT 5580 

AATTTTAACA CCAATTCAGT AGTTGATTGA GCAAATGCGT TGCCAAAAAG GATGCTTTAG 5640 

AGACAGTGTT CTCTGCACAG ATAAGGACAA ACATTATTCA GAGGGAGTAC CCAGAGCTGA 5700 

GACTCCTAAG CCAGTGAGTG GCACAGCATT CTAGGGAGAA ATATGCTTGT CATCACCGAA 5760 

GCCTGATTCC GTAGAGCCAC ACCTTGGTAA GGGCCAATCT GCTCACACAG GATAGAGAGG 5820 

GCAGGAGCCA GGGCAGAGCA TATAAGGTGA GGTAGGATCA GTTGCTCCTC ACATTTGCTT 5880 

CTGACATAGT TGTGTTGGGA GCTTGGATCG ATCCTCTATG GTTGAACAAG ATGGATTGCA 5940 

CGCAGGTTCT CCGGCCGCTT GGGTGGAGAG GCTATTCGGC TATGACTGGG CACAACAGAC 6000 

AATCGGCTGC TCTGATGCCG CCGTGTTCCG GCTGTCAGCG CAGGGGCGCC CGGTTCTTTT 6060 

TGTCAAGACC GACCTGTCCG GTGCCCTGAA TGAACTGCAG GACGAGGCAG CGCGGCTATC 612 0 

GTGGCTGGCC ACGACGGGCG TTCCTTGCGC AGCTGTGCTC GACGTTGTCA CTGAAGCGGG 618 0 

AAGGGACTGG CTGCTATTGG GCGAAGTGCC GGGGCAGGAT CTCCTGTCAT CTCACCTTGC 624 0 

TCCTGCCGAG AAAGTATCCA TCATGGCTGA TGCAATGCGG CGGCTGCATA CGCTTGATCC 63 00 

GGCTACCTGC CCATTCGACC ACCAAGCGAA ACATCGCATC GAGCGAGCAC GTACTCGGAT 6360 

GGAAGCCGGT CTTGTCGATC AGGATGATCT GGACGAAGAG CATCAGGGGC TCGCGCCAGC 6420 

CGAACTGTTC GCCAGGCTCA AGGCGCGCAT GCCCGACGGC GAGGATCTCG TCGTGACCCA 64 80 

TGGCGATGCC TGCTTGCCGA ATATCATGGT GGAAAATGGC CGCTTTTCTG GATTCATCGA 654 0 
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C.0^.CC3= CT<^OX.TOO COO^CCOCT. .C^.^^T. OCOT^C. CCC3T<=™T 
T3CT.«0;.. C-™OCa.C. ^..OOCrO. CCaCTTCCTC OTOCTTT^CO OT.TCGCCCC 
C.3COC.TC3 CCTTCTAT33 CCTTC.«=AC 0.3TTC.TCT 0.3C333.CT 
CT3333PrC3 «U.T3.CC3. <:C«3C3AC0 CCC^CCTOC C.TC.C3.G. TTTCGATTCC 
.CC3CC3CCT ^..ra^O 3TT333CTTC G3»TC3TTT TCCG33.C3C 333CT30.T0 
.TCCTCC.3C 3C3333ATCT CAT3CT33A3 TTCTTC3CCC .CCCC«.CTT GTrT.TTGC 
3c™T«T3 3TT.C».T. «3C;^T.3C .TCC^TT TC.C»»T« .OC.TTTTTT 
TC.CT3CATT ^3^X33 TTT3TCC». 3TCATCA.TC T.TCTT.TCA T3TCT33*TC 
0C30CC3C3. TCCC3TC3.3 .0CTT33C3T ^TC.T33TC AT.3CT3TT. CCT0T3T0« 
.TT3TT.TCC 3CT»C;^TT C»CAC»» T.C3.3CC33 »OC.T«AO xaT».3CCT 
03a3T3CCT. .T3.3T3.3C T»CT3.»T T^TT3C3TT 3C3CTC.CT3 CCC30TTTCC 
.3TC330.» CCT3TC3T0C C.3CT3CATT »T3;^TC33 CC;>.C0C3CG 333A3.33C3 
3T™C3T.T T333C3CTCT TCCGCrTCCT C3CTC.CT3. CTCOCT3C3C TC33TCOrrC 
33CT3C33C3 .0C33T.TC. 3CTCACTC» A33333T»T ;.C33™tCC .»G»TC.3 
333AT«.C3C .33».3»C «3T3.3C» «333C.3C. «^33CC.33 ^CC3T^ 
.33CCGC3TT 3CT33C3TTT TTCC;.™33C TCC3CCCCCC T3.C0;.3C.T CAC«^TC 
3.C3=TC»0 TCA3.33T33 C3^CCC3. CA3G.CT.T. ;^3.T.CCA3 3C3TTTCCCC 
CT33«.3CXC CCTC3T3COC TCrC<:T3TTC C3.CCCT3CC 33^3033* T.C3T3TC33 
CCTTTCTCCC TTC333^3C 3T33C3CI^ 3T3WG3TC ACG3T3T;.3G TATCTCAGTT 
C3GT3T.G3T CGTT3GCTCC «3CTG3GCT 3T3T3C.CG. .CC3CCC3TT C.GCCC3.CC 
3CTGC33CTT .T3C3GT«.C TATC3TCTr3 .GTCC;.A3CC G3T«.3»C.C G.CTT.TC3C 
C.CT.3CA3C A3C3.CTG3T «.C.33.TT. 3C.G.GCG.G 3T.T3T;.3GC 33T3CT.C.G 
.GTTCTTG;^ 3TG3T3GCCT »CTACG3CT ;.C.CTAG»G G.C.GT.TTT 3GT.TCT3C3 
CTCT3CI^ 3CCA3TTACC TTCG3A«^ GA3TT33™3 CTCTTGATCC 33C«ACA» 
C«CCGCr33 TA3C3GTG3T TTTTTTGrTT 3C«GCA3C. 3ATTA3GCGC .G«^3 
3.T3TC«3A A3.TC3TrrG .TCTrTTCT. C3G3GTCTG;. C3CTCA3T33 «.CG^CT 
CCGTT^GG 3«TTTG3X3 ;.T3.3.™T C«^G3;.T CTTC.CCT.3 ATCCTTTT«. 
..T;^TG »GTTTT»* TC«.TCT»A GT™T.T3. GT^C™3 TCTG.C.GTX 
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.CC«X.CTT »T»=^- =-CCT.TCT C.CO.TCTO TCT.TTTC3T TCTCC.T.O 
™CCTa.CT -T.CO.O. ...CTT.CC. TC^CCCC. 

O.OCT=C«X O.CCC.C3CT C.CC30CTCC ^.^X^.C. 3C«T^CC 

^003CC3.0 -CCTOC^C TXT.TCCOCC TC^T 

^.-rr^-rro xroccooo;^ ocT.=.oT^ o-ottcocc A.rr»T;^T tt.coc»c. 

™.™C»T T0CT»C..3C .TCOT^OTOT C.COCTCOTC 0TTT30T.T3 OCTTCT^ 
3CTCC33rrC CC«C<^TC. .30C«S™ »T3.TCCCC C.T=TT3T0C »»«.0<:30 
™3C.CC^ C33TCCTCC3 .TCO^TC. 3»3T«3TT 33CC3C.3T3 TT.TC.C.C. 
.33TT.T33C .OC.CT3C.. ^TTCTC™ CTOTC.^CC .T=C3X^3. T3CTTTTC.3 
™.CT33T3. 3T.CTC«.C »3TC.T.C. 3.3^T.3.3 T.T3C33C3. CC3.aTX0CT 

CTT3CCC33C 3TC^T.C3. 3.T«T.CC0 C3CC.C.T.3 C.3»CTTT. «.3T3CTC. 

.C.TT33^ AC3TTC™3 333C3«^C TCTC»33^T CTT.CC3™ TT3.3.TCC. 

3.™. ™C3T OCCCC^CT 3.^™ ™.CT T.C.CC.3C3 

„CT333T3 .33^3. G3«.33C»» .T3CC3C»^ 

<^.3TT3 CTCTTCCTTT TT3W.™ TT3»3C.rr ..TC.333TT 

.^XCTCT 3.3330.X.C AT.T.TO^T 3T.™3^ -AT^C» .T.3333rrC 

CGCGCACATT TCCCCGAAAA GTGCCACCT 
C2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
ATCACAGATC TCTCACCATG GATTTTCAGG TGCAGATTAT CAGCTTC 



47 
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(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: UNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 ; 
TGCAGCATCC GTACGTTTGA TTTCCAGCTT 
(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 384 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. -384 

(ix) FEATURE: 

(A) NAME/KEY: mat_pepticie 

(B) LOCATION: 67.-3 84 



30 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
..3 ... ^ CO .XT ..C ..C ™ C.. ..C .O. OCT TC. 

Met ASP Phe Gin Val Gin He lie Ser Phe Leu 

Z .T. 1 TCC .0. 033 ^ .TT OTT CTC TCC TCT CC. OC. .TC 

Val He Met Ser Arg Gly Gin lie vax ^ 
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4 



- iz s I- s ir. -I z zi ?s zi - z s s 

15 

s s ?s in z - s s s 
s s ^ s z -I ^ 3- s i -J f.^ 

- C3C I .0. OOC JO. .C. 3^ .CX .C. .^C .C. C.C .C. ..C 

Val Arg Phe Ser Gly ber ^^xy o j 
60 

..C .OA 0.0 0.0 OCT OAA OAT OCT OCC ACX TAT TAC TOC CAO CAG TOO 
ser Arg val Glu Ala Glu Asp Ala Ala Thr Tyr Tyr cy 
75 80 85 

... ..P rrA CCC ACG TTC GGA GGG GGG ACC AAG CTG GAA ATC AAA 
ACT AGT AAC CCA CCC AL.^^ ixu Lys 
Thr ser Asn Pro Pro Thr Phe Gly Gly Gly Thr Lys 
95 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 128 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

.1 v.i nn He He Ser Phe Leu Leu He Ser Ala Ser 
Met Asp Phe Gin Val Gin lie ±xe 

-22 -20 

val lie Met Ser Arg Gly Gin He Val Leu Ser Gin Ser Pro Ala lie 
-5 1 ^ 

Val Thr Met Thr Cys Arg Ala Ser 



Leu ser Ala Ser Pro Gly Glu Lys Vai inr ...^ . 

15 

s„ ser V.1 S„ Tyr He His Trp Phe Oln 31n Lye «o Oly ser ser 

30 

Pr. .y, pro Trp He Tyr .1. Thr Ser .en Leu Al. Ser Gly V.l Pro 

45 

vex Ar^ Phe Ser Gly Ser Gly Ser Gly Thr Ser Tyr ser Leu Thr II. 

Ala Ala Thr Tyr Tyr Cys Gin Gin Trp 



60 ^5 



144 



192 



240 



288 



336 



384 



Ser Arg Val Glu Ala Glu Asp 

- 80 



• 



75 ^° 

^■1 ^^^r rlv Thr Lys Leu Glu He Lys 
Tt.r Ser Asn Pro Pro Thr Phe Gly Gly Gly 

95 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL; NO 

(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
GCGGCTCCCA CGCGTGTCCT GTCCCAG 
(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: YES 

fix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 3 nKr.icleotide 3 is N wherein N 
(D) OTHER INFORMATION: /note= ' Nucleotia 

is G or C." 

(ix) FEATURE: f^;,ture 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 18 n^ucleotide IB is N wherein 
(D) OTHER INFORMATION: /note- 

N is A or C." 
(ix) FEATURE: 
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(A) NAME/KEY: misc_feature 

\l\ S?Sr?NFORL.TION: /note= "Nucleotide 19 is N wherein 

N is A or G. " 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

lol SSr™PO^TI0»: /note. -Nucleotide 25 Is » wherein 

N is G or A. " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

29 

GGNTGTTGTG CTAGCTGNNG AGACNGTGA 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 420 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..42 0 

(ix) FEATURE: 

(A) NAME/KEY: mat_pept ide 

(B) LOCATION: 58.. 420 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

^nn PTr ATC TTG CTC TTC CTT GTC GCT GTT GCT ACG CGT 

z z - - - - 

- 1 5 - 1 0 

i Zl I- 1- - 1- -I I- S S Zl Zl - 

1 5 10 

CCT GGG GCC TCA GTG AAG ATG TCC TGC AAG GCT TCT GGC TAC ACA TTT 
III Gly Ala Ser Val Lys Met Ser Cys Lys Ala Ser Gly Tyr Thr 
15 20 25 

ACC AGT TAC AAT ATG CAC TGG GTA AAA CAG ACA CCT GGT CGG GGC CTG 
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Thr Ser Tyr Asn Met His Trp Val Lys Gin Thr Pro Gly Arg Gly Leu 
30 35 40 45 

GAA TGG ATT GGA GCT ATT TAT CCC GGA AAT GGT GAT ACT TCC TAG AAT 24 0 

Glu Trp He Gly Ala He Tyr Pro Gly Asn Gly Asp Thr Ser Tyr Asn 
50 55 60 

GAG AAG TTC AAA GGC AAG GCC ACA TTG ACT GCA GAC AAA TCC TCC AGC 288 
Gin Lys Phe Lys Gly Lys Ala Thr Leu Thr Ala Asp Lys Ser Ser Ser 
65 70 75 

ACA GCC TAG ATG CAG CTC AGC AGC CTG ACA TCT, GAG GAC TCT GCG GTC 336 
Thr Ala Tyr Met Gin Leu Ser Ser Leu Thr Ser Glu Asp Ser Ala Val 
80 85 90 

TAT TAC TGT GCA AGA TCG ACT TAG TAG GGC GGT GAC TGG TAG TTC AAT 3 84 

Tyr Tyr Cys Ala Arg Ser Thr Tyr Tyr Gly Gly Asp Trp Tyr Phe Asn 
95 100 105 

GTC TGG GGC GCA GGG AGC ACG GTC ACC GTC TCT GCA 42 0 

Val Trp Gly Ala Gly Thr Thr Val Thr Val Ser Ala 
110 115 120 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 0 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Gly Trp Ser Leu lie Leu Leu Phe Leu Val Ala Val Ala Thr Arg 
-19 -15 -10 -5 

Val Leu Ser Gin Val Gin Leu Gin Gin Pro Gly Ala Glu Leu Val Lys 
1 5 10 . 

Pro Gly Ala Ser Val Lys Met Ser Cys Lys Ala Ser Gly Tyr Thr Phe 
15 20 25 

Thr Ser Tyr Asn Met His Trp Val Lys Gin Thr Pro Gly Arg Gly Leu 
30 35 40 45 

Glu Trp He Gly Ala He Tyr Pro Gly Asn Gly Asp Thr Ser Tyr Asn 
50 55 60 

Gin Lys Phe Lys Gly Lys Ala Thr Leu Thr Ala Asp Lys Ser Ser Ser 
65 70 75 

Thr Ala Tyr Met Gin Leu Ser Ser Leu Thr Ser Glu Asp Ser Ala Val 
80 85 90 
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Tvr Tvr Cys Ala Arg Ser Thr Tyr Tyr Gly «ly Asp Trp Tyr Phe Asn 
95 100 \ 105 



ii a: 



Val Trp Gly Ala Gly Thr Thr Val Thr Val SeV Ala 

110 
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